Corpus: est_wikipedia_2010_30K

Other corpora

3.6.3 Zipf's law for words with same first letter

Zipf's law restricted to words with first letter a, b, c, and d


Zipf's diagram for words of fixed length


Gnuplot diagram

Top words a-
rank frequency word
1 1042 aasta
2 933 aastal
3 828 aga
4 654 ajal
5 324 aastat
Top words b-
rank frequency word
1 49 bändi
2 37 bänd
3 36 b
4 24 baltisaksa
5 18 bolševike
Top words c-
rank frequency word
1 101 cm
2 12 ca
3 7 c
4 7 cum
5 2 cd
Top words d-
rank frequency word
1 159 detsember
2 135 detsembril
3 132 de
4 43 direktor
5 38 dünastia
48 msec needed at 2021-08-21 09:02